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Abstract. The properties characterizing Sturmian words are considered for words on multiliteral al- 
phabets. We summarize various generalizations of Sturmian words to multiliteral alphabets and enlarge 
the list of known relationships among these generalizations. We also collect many examples of infinite 
words to illustrate differences in the generalized definitions of Sturmian words. 

1. Introduction 

Sturmian words, i.e., aperiodic words with the lowest factor complexity, appeared first in the paper of 
Hedlund and Morse in 1940. Since then Sturmian words have been in the center of interest of many math- 
ematicians and the number of discoveries of new properties and connections keeps growing. The charm 
of Sturmian words consists in their natural appearance while studying diverse problems. Many equiva- 
lent definitions have been found that way. Sturmian words are binary and every property characterizing 
Sturmian words asks for a fruitful extension to an analogy on a larger alphabet. Well-known examples of 
such efforts are Arnoux-Rauzy words, words coding interval exchange transformations, or billiard words. 
All these words belong to well established classes and their descriptions and properties can be found in 
many works [6, 36, 40, 27, 5, 11, 46]. An overview of some generalizations of Sturmian words is provided 
in [12] and [50]. 

The aim of this paper is to attract attention to other generalizations of Sturmian words. Our motivation 
stems from recent results on palindromes in infinite words that have ended in the definition of words rich 
in palindromes, the definition of defect, the description of a relation between factor and palindromic 
complexity, etc. [3, 15, 7[. Impulses for such an intensive research of palindromes come concededly from 
the article [22] which characterizes Sturmian words by palindromes, the article [23] which investigates 
the number of palindromes in prefixes of infinite words and last, but not least, the discovery of the role of 
palindromes in description of the spectrum of Schrodinger operators with aperiodic potentials [32] . While 
generalizing Sturmian words we have taken into consideration the characterization of Sturmian words by 
return words from [49] and a recent definition of Abelian complexity [43, 42], which is closely connected 
with balance properties. 

We consider the following properties (k denotes the cardinality of alphabet A): 

(1) Property C: 

the factor complexity of u satisfies C(n) = (k — l)n + 1 for all n £ N. 

(2) Property £7e: 

u contains one left special and one right special factor of every length. 

(3) Property BO: 

all bispecial factors of u are ordinary. 

(4) Property K: 

any factor of u has exactly k return words. 

(5) Property V: 

the palindromic complexity of u satisfies Vin) + Vin + 1) = k + 1 for all n G N. 

(6) Property V£: 

every palindrome has a unique palindromic extension in u. 

(7) Balance properties: 
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(a) Property By. 

u is aperiodic and for all a G A and for all factors w,v 6 £(u) with \w\ — \v\ it holds 

\\w\ a - \v\ a \ < fc-1. 

(b) Property Bg: 

u is aperiodic and there exists a E A such that for all factors w,v G £(u) with |iu| = |w| it 
holds 

|Ha-Ma|<fe-l- 

(c) Property .AC: 

u is aperiodic and the abelian complexity of u satisfies .AC(n) = k for all n G N, n > 1. 

All properties are equivalent on a binary alphabet and they characterize Sturmian words. No two 
of them are equivalent on the set of infinite words over a multiliteral alphabet. The non-equivalence is 
shown by counterexamples. However some properties imply others, or it can be shown that a couple 
of properties are equivalent on a certain class of infinite words. For instance, on the class of uniformly 
recurrent ternary words Properties 1Z and BO are equivalent. 

There exist more equivalent definitions of Sturmian words, for instance the definition based on balance 
properties of subfactors of factors [25], on the index of an infinite word [38], or Richomne's characteristics 
of Sturmian words [41]. We do not pay attention to these definitions in our survey. 

The paper is organized as follows. In section 2 we recall the notions playing an important role in the 
definitions of Properties 1 through 7. We recall the notion of substitution which is irrelevant for the 
generalizations of Sturmian words but is used to construct most of examples of infinite words. Section 3 
is focused on the study of palindromes in infinite words: we summarize older and new results concerning 
palindromes, we define palindromic branches. A new result in this section is Theorem 11 providing a new 
characterization of rich words by means of bilateral orders. Section 4 shortly summarizes essential results 
on Sturmian words. Section 5 is devoted to an overview of known relations among different generalizations 
of Sturmian words, mostly from articles [7, 16, 30, 9, 43, 42]. New results are in Theorems 22 and 26 
and Corollaries 24 and 25. The last section is a brief summary of selected relations and examples 
illustrating the studied Properties. 

2. Notations and definitions 

By A we denote a finite set of symbols, usually called letters; the set A is therefore called an alphabet. 
A finite string w — woWi . . .w n -i of letters of A is said to be a finite word, its length is denoted by 
|io| = n. Finite words over A together with the operation of concatenation and the empty word e as the 
neutral element form a free monoid A*. The map 

W = W Wl .. .W n -l l-> W = W n -iW n -2 ■ ■ -w a 

is a bijection on A* , the word w is called the reversal or the mirror image of w. A word w which coincides 
with its mirror image is a palindrome. 

Under an infinite word u over the alphabet A we understand an infinite string u = 110111U2 ... of letters 
from A such that every letter of A occurs in u. We call an infinite word u eventually periodic if there 
exist finite words w,v such that u = wv u , where lj means 'repeated infinitely many times'. If w = e, 
then u is said to be (purely) periodic. If u is not eventually periodic, then we call u aperiodic. 

A finite word w is a factor of a word v (finite or infinite) if there exist words and such that 
v = w^ww^K If u/ 1 ) = e, then w is said to be a prefix of v, if = e, then w is a suffix of v. We say 
that a prefix or a suffix is proper if it is not equal to the word itself. 

The language £(u) of an infinite word u is the set of all its factors. The factors of u of length n form 
the set denoted by £„(u). Using this notation, we may write £(u) = U n(£ N>Cn(u). 

We say that the language £(u) is closed under reversal if £(u) contains with every factor w also its 
reversal w. 

An infinite word u over A is called c-balanced if for every a € A and for every pair of factors w, v 
of u of the same length |w| = \v\, we have \\w\ a — \v\ a \ < c, where \w\ a means the number of letters 
a contained in w. Note that in the case of a binary alphabet, say A = {0, 1}, this condition may be 
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rewritten in a simpler way: an infinite word u is c-balanced, if for every pair of factors w, v of u with 
\w\ = \v\, we have ||w|o — \v\o\ < c. We call 1-balanced words simply balanced. 

We say that two words w,v E A* are abelian equivalent if for each letter a G A, it holds |w| a = \v\ a . 
It is easy to see that the abelian equivalence defines indeed an equivalence relation on .4*. If A = 
{a 1; a 2 , . . . , afe}, then the Parikh vector associated with the word w G A* is defined as 

*H = (\ w \ ai ,\w\a 2 ,---,\w\a k )- 

We call abelian complexity (as defined in [42]) of an infinite word u the function AC : N — >• N given by 

AC(n) = #{*(«;) | w G £„(u)}. 

For any factor w G £(u), there exists an index i such that w is a prefix of the infinite word 
UiU i+ iu i+ 2 ■ ■ ■■ Such an index i is called an occurrence of w in u. If each factor of u has at least 
two occurrences in u, the infinite word u is said to be recurrent. It can be easily shown that each factor 
of a recurrent word occurs infinitely many times. It is readily seen to see that if the language of u is 
closed under reversal, then u is recurrent. The infinite word u is said to be uniformly recurrent if for any 
factor wofu the distances between successive occurrences of w form a bounded sequence. 

Let j, k, j < k, be two successive occurrences of a factor w in u. Then UjUj + \ . . . Uk-\ is called a return 
word of w. Return words were first studied in [24] and [33]. The set of all return words of w is denoted 
by R(w), 

R(w) — {ujUj + \ . . . iifc_i | j, k being successive occurrences of w in u}. 

If v is a return word of w, then the word vw is called a complete return word of w. It is obvious that an 
infinite recurrent word is uniformly recurrent if and only if the set of return words of any of its factors is 
finite. 

The (factor) complexity of an infinite word u is the map C : N 4 N. defined by C(n) = #£„(u). To 
determine the increment of complexity, one has to count the possible extensions of factors of length n. 
A left extension of w G C(u) is any letter a G A such that aw G £(u). The set of all left extensions of 
a factor w will be denoted by Lext(w). We will mostly deal with recurrent infinite words u. In this case, 
any factor of u has at least one left extension. A factor w is called left special (or LS for short) if w has 
at least two left extensions. Clearly, any prefix of a LS factor is LS as well. It makes therefore sense to 
define an infinite LS branch which is an infinite word whose all prefixes are LS factors of u. Similarly, 
one can define a right extension, a right special (or RS) factor, Rext(w), and an infinite RS branch which 
is a left-sided infinite word whose all suffixes are RS factors of u. 

We say that a factor w of u is a bispecial (or BS) factor if it is both RS and LS. The role of BS factors 
for the computation of complexity can be nicely illustrated on Rauzy graphs. 

Let u be an infinite word and n G N. The Rauzy graph T n of u is a directed graph whose set of vertices 
is £„(u) and set of edges is £ n+ i(u). An edge e G £ n+ i(u) starts in the vertex w and ends in the vertex 
v if w is a prefix and v is a suffix of e, see Figure 1. If the word u is recurrent, the graph T n is strongly 



WqWi ■ ■ ■ W n -iW n 



W = WqWi ■ ■ ■ W n -1 V = Wl • • ■ W n -\W r , 



Figure 1 . Incidence relation between an edge and vertices in a Rauzy graph. 

connected for every n G N, i.e., there exists a directed path from every vertex w to every vertex v of the 
graph. 

If the language £(u) of the infinite word u is closed under reversal, then the operation that to every 
vertex w of the graph associates its mirror image, the vertex w, and to every edge e associates e maps 
the Rauzy graph T n onto itself. 

The outdegree (indegree) of a vertex w G £„(u) is the number of edges which start (end) in w. Obviously 
the outdegree of w is equal to #Rext(w) and the indegree of w is #Lext(w). The sum of outdegrees over 
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all vertices is equal to the number of edges in every directed graph. Similarly, it holds for indegrees. In 
particular, for the Rauzy graph r n we have 

Y #Rext(» = C(n+1) = Y #Lext(w). 

The first difference of complexity AC(n) = C(n + 1) — C(n) is thus given by 

AC(n) = Yl (#RextH-l) = Y (#Lext(tu) - l) . 

A non-zero contribution to AC(n) in the left-hand sum is given only by those factors w € C n (u) for 
which #Rext(w) > 2, and for recurrent words, a non-zero contribution to AC(n) in the right-hand sum 
is provided only by those factors w E C n (u) for which #Lext(w) > 2. The last relation can be thus 
rewritten for recurrent words u as 

AC(n)= Yl (#RcxtH-l) = Y (#LextH-l). 

w£C n (u), w RS wGC rl (u) 1 w LS 

If we denote Bext(w) = {awb G £(u) | a, b e .4}, then the second difference of complexity A 2 C(n) = 
AC(n + 1) - AC(n) = C(n + 2) - 2C(n + 1) + C(n) is given by 

(2.1) A 2 C(n)= Y (#Bcxt(w) - #Rext(w) - #Lcxt(w) + 1) . 

u>e£„(u) 

Denote by b(w) the quantity 

b(w) := #Bext(w) - #Rext(w) - #Lext(w) + 1. 

The number b(w) is called the bilateral order of the factor w and was introduced in [18]. It is readily 
seen that if w is not a BS factor, then b(w) = 0. Bispecial factors are distinguished according to their 
bilateral order in the following way 

• if b(w) > 0, then w is a strong BS factor, 

• if b(w) < 0, then w is a weak BS factor, 

• if b(w) = then w is an ordinary BS factor. 

A substitution on A is a morphism (p : A* ^ A* such that there exists a letter a <E A and a non- 
empty word w E A* satisfying Lp(a) — aw and ip(b) ^ e for all 6 £ A Since a morphism satisfies 
ip(vw) = ip(v)ip(w) for all v 7 w £ „4*, any substitution is uniquely determined by the images of letters. 
Instead of classical (p(a) — w, we sometimes write a — >• w. A substitution can be naturally extended to 
an infinite word u = u M i u 2 • • • by the prescription (p(u) = Lp(uo)(p(ui)ip(u2) ■ ■ ■ An infinite word u is 
said to be a fixed point of the substitution (p if it fulfills u = <p(u). It is obvious that every substitution ip 
has at least one fixed point, namely lin^^oo ip n (a) (to be understood in the sense of product topology). 

3. Words opulent in palindromes 

In resemblance to the factor complexity C(n) of an infinite word u, let us define the palindromic 
complexity of u as the map V : N — > N given by 

V{n) = #{w e £ n (u)| w = w}. 

If a € A and w is a palindrome and awa G C(u), then awa is said to be a palindromic extension of w. 
The set of all palindromic extensions of w is denoted by Pext(w). 

Similarly as in the case of left special and right special branches, one can define a palindromic branch 
of u. 

Definition 1. Let u be an infinite word. A both-sided infinite word v = . . . U3f2^iWiW2^3 • • • is a palin- 
dromic branch with center £ of the word u if for every n £ N the word w n w„_i . . . V2V1V1V2 ■ ■ ■ u„_iu„ is a 
factor o/u. Let a be a letter. A both-sided infinite word v = . . . v 3 V2ViaviV 2 v 3 . . . is a palindromic branch 
with center a of the word u if for every neN the word v„v„_i . . . v 2 viaviv 2 ■ ■ ■ v n ^\V n is a factor of u. 
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It follows from the Konig's theorem that if u has infinitely many palindromes then u has at least one 
palindromic branch. In any Sturmian word on {0, 1} there exist exactly three palindromic branches with 
centers e, and 1. See also Section 5.1. 

Uniformly recurrent words containing infinitely many distinct palindromes satisfy that for any factor w, 
every sufficiently large palindrome in u contains w, thus such a palindrome contains w as well. As 
a consequence, we have the following theorem. 

Theorem 2. If u is a uniformly recurrent word that contains infinitely many distinct palindromes, then 
its language C(u) is closed under reversal. 

The opposite implication is not true as illustrated by the following example. 

Example 1 (uniform recurrence + closeness under reversal =f> infinitely many palindromes) . The infinite 
word u on {a, b} (constructed in [13]) whose prefixes u n are given by the following recurrent formula 

u = ab, u n+ i = u n abu^, 

is uniformly recurrent and its language is closed under reversal. However, u contains only a finite number 
of palindromes. 

When we relax the condition of uniform recurrence, the statement of Theorem 2 is not true any more. 

Example 2 (infinitely many palindromes closeness under reversal). The infinite word u on {a, b, c} 
whose prefixes u n are given by the following recurrent formula 

u = e, u n+ i = u n abc n+1 u n 

is clearly recurrent. Infinitely many palindromes are represented by the factors c" for every n. As the 
factor ba does not occur, the set of factors is not closed under reversal. 

The word u may be recoded to a binary alphabet while preserving the mentioned properties. We may 
for instance recode u using the following mapping: 

a ->■ 0110, b -> 1001, c-> 1. 

An interesting relation between the palindromic and factor complexity has been revealed in [7]. 

Theorem 3. Let u be an infinite word with the language C(u) closed under reversal. Then 

(3.1) V(n + 1) + V{n) < AC(n) + 2 for all n e N. 

In fact, the above relation is stated in [7] for uniformly recurrent words, however the proof requires 
only recurrent words. Theorem 3 implies that infinite words reaching the equality in (3.1) are in a certain 
sense opulent in palindromes. Another measure of opulence in palindromes has been provided in [23]. 

Theorem 4. Every finite word w contains at most \w\ + 1 palindromes (including the empty word). 

Definition 5. An infinite word u satisfying that every factor w of u contains \w\ + 1 palindromes is 
called rich in palindromes. 

The following equivalent definitions of richness have been proved in [30], [16], [17], respectively. 

Theorem 6. For any infinite word u the following conditions are equivalent: 

(1) u is rich, 

(2) any return word of a palindromic factor of u is a palindrome, 

(3) for any factor w of u, every factor of u that contains w only as its prefix andw only as its suffix 
is a palindrome, 

(4) each factor ofu is uniquely determined by its longest palindromic prefix and its longest palindromic 
suffix. 

We will need for our further purposes an implication that holds only for languages closed under reversal. 

Corollary 7 ([16]). Let u be a rich infinite word with the language closed under reversal. Then for any 
factor w of u, the occurrences of w and w alternate. 
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A natural question is whether infinite words reaching the equality in (3.1) coincide with rich words. 
The following theorem proved in [16] for uniformly recurrent words, however valid even for infinite words 
with the language closed under reversal, provides an answer. 

Theorem 8. Let u be an infinite word with the language £(u) closed under reversal. Then u is rich if 
and only ifV(n + 1) + V(n) = AC(ra) + 2 for all n e N. 

Let us explain that Theorem 8 is slightly stronger than the equivalence of richness and the equality 
in (3.1) for uniformly recurrent words, proved in [16]. In other words, the following statement is a corollary 
of Theorem 8. 

Corollary 9. Let u be a uniformly recurrent infinite word. Then u is rich if and only ifV(n+l)+V(n) = 
AC(n) + 2 for all n e N. 

Proof. If C(u) is closed under reversal, then the statement follows from Theorem 8. If £(u) is not closed 
under reversal, then by Theorem 2, u contains only a finite number of palindromes. It is then readily 
seen that u is neither rich, nor the equality in (3.1) is attained for all n E N. □ 

Let us correct the following example given in [16]. The word u generated by the substitution a — > 
aba, b — > bb is recurrent, however not uniformly recurrent, and the language C(u) is closed under reversal. 
By inspection of the complete return words of palindromic factors, applying Theorems 6 and 8, it may be 
proved that the equality in (3.1) is attained. The authors of [16] claimed that V(2) + V(3) ^ AC (2) + 2. 
This mistake is however based on the fact that C(3) = 5 and not 6. 

Let us mention as an open problem the following question. "Does the equivalence of richness and the 
equality in (3.1) hold for a larger class than words with the language closed under reversal? For instance 
for all recurrent words?" 

The following observations may serve as hints: 

• It does not hold for non-recurrent infinite words in general. The infinite word ab^ is given in [16] 
as an example of a rich non-recurrent infinite word (with the language of course not closed under 
reversal), which does not reach the equality in (3.1) for all n 6 N. 

• Notice that both rich infinite words and infinite words reaching the equality in (3.1) contain 
infinitely many palindromes. 

• If u is rich and recurrent, then £(u) is closed under reversal (proved in [30], Proposition 2.11). 
The rest of this section is devoted to the relation between richness and bilateral orders of factors. The 

following proposition reveals some information on bilateral orders of palindromic bispecial factors in an 
infinite word with the language closed under reversal. 

Proposition 10. Let u be an infinite word whose language is closed under reversal. Then the bilateral 
order b(w) of a palindromic bispecial factor w <E £(u) has a different parity than the number of palindromic 
extensions ofw. 

Proof. Let w be a palindromic BS factor of u. On one hand, as the language is closed under reversal, we 
have #Lext(w) = #Rcxt(w). Consequently, from the definition of bilateral order one can see that the 
parity of ffBcxt(w) is different from the parity of b(w). On the other hand, the parity of the number of 
palindromic extensions of w equals the parity of #Bext(u>) since for any a, b e A, if awb £ £(u), then 
bwa e £(u). □ 

In the sequel, we will state and prove a new equivalent definition of rich words by means of bilateral 
orders. 

Theorem 11. Let u be an infinite word with the language £(u) closed under reversal. Then u is rich if 
and only if any bispecial factor w of u satisfies: 

• if w is non-palindromic, then 

h(w) = 0, 

• if w is a palindrome, then 

b(w) = #Pext(u;) - 1. 



STURMIAN JUNGLE (OR GARDEN?) ON MULTILITERAL ALPHABETS 



7 



The following lemma will provide the most important tool for the proof of Theorem 11. 

Lemma 12. Let u be a rich infinite word whose language is closed under reversal. Then it holds for any 
bispecial factor w : 

• if w is non-palindromic, then 

b(w) > 0, 

• if w is a palindrome, then 

b(w) > #Pcxt(w) - 1. 

Proof. Let w be a non-palindromic BS factor. By the definition of b(w), we want to prove 

#Bcxt(w) > #Rext(u;) + #Lext(w) - 1. 

We will construct a bipartite oriented graph G having its set of vertices V defined as 

V = {wa\a G Rext(iu)} U {wa|o G Rext(w)} . 

There is an oriented edge from wa to wb if there exists a factor vb g £(u) such that wa is its prefix, wb 
is its suffix and factors w and TZJ occur each exactly once in vb. Furthermore, there is an oriented edge 
for wx to wy if there exists a factor vy g £(u) such that wx is its prefix, wy is its suffix and factors w 
and w occur each exactly once in v. 



w 



wb 



w a 



wa 



wb 



wa 



Figure 2. Incidence relation in the graph G. 

Due to Theorem 6, such a factor v is a palindrome. Therefore the existence of an edge from wa to wb 
implies awb g C(u), and so bwa G £(u), too. Analogously, if there is an edge from wx to wy, we have 
xwy G £(u). 

By Corollary 7, the occurrences of w and w alternate. Thus, to any factor of u corresponds a path in 
G. As u is recurrent, the graph G is strongly connected. 

As a consequence, the number of pairs of its vertices which are connected by an edge is greater than 
or equal to the number of its vertices minus 1. We have 

#Bext(w) > #Rext(w) + #Rcxt(uJ) - 1. 

Since Rext(w) = Lext(w) the proof of the first part is finished. 

Let w be a palindromic BS factor. Let us consider this time a graph G whose set of factors V is defined 

as 

V = {wa\a G Rext(w)} . 

There is an edge from wa to wb if there exists a factor vb G £(u) such that v is a complete return word 
to w that has wa as a prefix. As u is rich, v is a palindrome. Due to the recurrence of u, for every 
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awb G £(u), a ^ b, there exists an edge in G going from wa to wb. As the language is closed under 
reversal, the edge going from wb to wa is in G, too. Therefore 

# {awb G jC(u)|o 7^ b} — 2 x the number of pairs of distinct vertices connected by an edge. 

Owing to the recurrence of u, the graph G is strongly connected, thus the number of pairs of distinct 
vertices connected by an edge is greater than or equal to the number of vertices of G minus 1, which 
equals #Rext(w) — 1. We find 

#Bcxt(w) = # {awb G C(u)\a ^ b} + #Pcxt(w) > 2 (#Rext(w) - 1) + #Pext(io). 

As Rext(w) = Lext(ui), the statement is proved. 

□ 

Proof of Theorem 11. (<=): Let us show by mathematical induction that 

/\C{n) + 2 = V{n+l)+V{n) for all n G N. 

Since £(u) is closed under reversal, this means by Theorem 8 that u is rich. 

The assumption on bilateral orders and the fact that non-bispecial palindromic factors have a unique 
palindromic extension guarantee the following equality for all n G N: 

(3.2) A 2 C(n)= b(io)= E (#Pext(tu) - 1) = 7>(n + 2) - P(n). 

wEC n {u) toG£n(u) 

W — w 

For n = 0, we can write AC(0) + 2 = C(l)-C(0) + 2= #A+1. On the other hand we have V{1) + T(0) = 
#-4 + 1. 

Take JVeN. Assume AC(n) + 2 = Tin + 1) + Tin) holds for all n < N. Using the induction assumption 
and (3.2), we obtain 

AC(A) + 2 = (AC(N) — AC(N — 1)) + (AC(N — 1) + 2) 
= A 2 C(N -1) + (T(N -1)+T(N)) 
= (T(N + l)-T(N-l)) + (T(N-l)+T(N)) 
= T(N + 1)+T(N). 

(=>): Take n G N arbitrary. We will prove the statement of the theorem for all BS factors of length n. 
As u is rich and the language C(u) is closed under reversal, we have by Theorem 8 

AC(fc) + 2 = T(k + 1) + T(k) for all k G N. 

Applying this equality, we will deduce the form of A 2 C(n). 

A 2 C(n) = (AC(n + 1) + 2)-(AC(n) + 2) = {Tin + 2) + Tin + l))-(P(n + 1) + 7>(n)) = T{n+2)-T(n). 
Consequently, we obtain 

E h(w) = A 2 C(n) =T(n + 2) -T(n) = E (#Pext(tu) - 1) . 

wE.£n(u) iu€£n(u) 

■uj=TZT 

Palindromic factors that are not BS have obviously exactly one palindromic extension. Thus, we can 
rewrite the previous equality 

(3.3) E b H= E (#PextH-l). 

w—w.w BS 

Let us split the sum of bilateral orders into two parts and use Lemma 12 

(3.4) ]T b(w)= E bH+ E b H> E b H+ E (#PcxtH-i). 

W€;C n (u) w£C n (\l) w€:Cn(u) WGC n (u) w€C n (u) 

w^w, w BS w=w, w BS w^w, w BS w=w, w BS 
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This in combination with (3.3) gives \ b(w) = 0. By Lemma 12, bilateral orders of such factors 

«>e£„(u) 

w^w, w BS 

are non-negative, which implies b(w) = for all non-palindromic BS factors. Since the equality is reached 
in (3.4), we obtain ^ b(w) = (#Pcxt(w) — 1) . Together with Lemma 12, this results 

u>e£„(u) weCn(u) 

w—w. w BS w—w, w BS 

in b(w) = #Pext(u>) — 1 for all palindromic BS factors. □ 

4. Equivalent definitions of Sturmian words 

Let us stress a close link between periodicity and complexity (revealed by Hedlund and Morse [31]). 
On one hand, the complexity of eventually periodic words is bounded. On the other hand, if there exists 
neN such that C(n) < n, then the complexity is bounded and the infinite word u is eventually periodic. 
In consequence, the complexity of aperiodic words satisfies C{n) > n + 1 for all n £ N. Sturmian words 
are defined as infinite words with the complexity C(n) = n+ 1 for all n £ N. This condition on complexity 
implies many properties. Let us list some of them. If u is a Sturmian word, then u has the following 
properties: 

• u is a binary word, 

• u is aperiodic, 

• the language C(u) is closed under reversal, 

• the language C(u) contains infinitely many palindromes, 

• the word u is uniformly recurrent, 

• the language £(u) contains no weak bispecial factors, 

• u is rich. 

There exist many equivalent definitions of Sturmian words. The following theorem summarizes several 
of their well-known combinatorial characterizations. 

Theorem 13. Let u be an infinite word over the alphabet A. The properties listed below are equivalent: 
(i) u is Sturmian, i.e., C(n) = n + 1 for all n, 

(ii) u is binary and contains a unique left special factor of every length, 
(Hi) u is binary, aperiodic and every bispecial factor is ordinary, 

(iv) any factor of u has exactly two return words, 

(v) u contains one palindrome of every even length and two palindromes of every odd length, 

(vi) u is binary and every palindrome has a unique palindromic extension, 

(vii) u is aperiodic and balanced, 

(viii) u is aperiodic and AC(n) = 2 for all n £ N, n > 1. 

The characterization by return words is due to Vuillon [49] and the one by the abelian complexity 
is a consequence of the works by Coven and Hedlund [20]. The equivalent definition based on the 
balance property comes already from Hedlund and Morse [39] . The two equivalent properties concerning 
palindromes have been proved by Droubay and Pirillo [22]. Notice that the sixth property can be 
equivalently rewritten as 

V(n)+V(n+1) = 3 for all n £ N, 

and also as 

V(n + 2) = V{n) for all n £ N. 
Let us recall that V(0) = 1 since the empty word is considered to be a palindrome. 

5. Generalizations of Sturmian words 

We have seen that Sturmian words can be defined in many equivalent ways. As a matter of course, 
various generalizations to multiliteral alphabets have been suggested and studied. 
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5.1. Two well-known generalizations. The most studied generalizations are Arnoux-Rauzy words 
and words coding fc-interval exchange transformation. 

Arnoux-Rauzy words (or AR words for simplicity) are infinite words with the language closed under 
reversal and containing exactly one LS factor w of every length, and such that every LS factor has the 
same number k of left extensions, i.e., #Lext(w) = fc. Their alphabet A has k letters since the empty 
word has exactly k left extensions. AR words are aperiodic and satisfy C(n) = (k — l)n + 1 for all 
n E N. They have been defined and studied in [23], the following properties have been proved ibidem. 
The language of AR words contains infinitely many palindromes, they are uniformly recurrent, rich, and 
have only ordinary BS factors. AR words form a subclass of extensively studied episturmian words (see 
for instance [29]), defined as infinite words that have the language closed under reversal and contain at 
most one LS factor of every length. 

Another well-known generalization of Sturmian words is provided by words coding k-interval exchange 
transformation. Let us state their definition and then explain why such words generalize Sturmian words 
to fc-letter alphabets. Take positive numbers ot\, . . . ,etk such that J2i=i a i = 1- They define a partition 
of the interval I = [0,1) into k subintervals 

j-i 3 

T 3 = E a " X! a ») ' 3 = !' 2 > • • • ' k - 

i=l i=l 

The interval exchange transformation is a bijection T : I — > I given by the prescription 

T(x) = x + cj for all x G Ij, j G {1, 2, . . . , k}, 

where Cj are suitably chosen constants. Since T is a bijection, the intervals T(Ii),T(l2), ■ ■ ■ ,T(Ik) form 
a partition of I. The orders of T(Ij) in the partition define a permutation 7r : {1, 2, . . . , k} — > {1, 2, . . . , k} 
and this permutation ir determines uniquely the constants Cj. For instance, if the permutation ir is 
symmetric, i.e., ir = ( £ k 2 i 1 " then the transformation T is of the following form 

T(x) = x + ai — a, for x G Ij. 

i>j i<3 

The infinite word u = u uiU2 . . . over A = {ai, . . . , a^} associated with T is defined as 

u n := a j if T n (x) G Ij 

and is called a word coding k-interval exchange transformation (k-iet word for short). 

From the point of view of combinatorics on words, an important role is played by those transformations 
whose orbit for an arbitrary x E I is dense in /, i.e., the closure of {T n (x) | n G N} is the whole interval /. 
A sufficient condition for this property represents the so-called i.d.o.c. (consult [36]) and the irreducibility 
of the permutation ir. In the sequel, let us assume that T satisfies both of these properties. The fc-iet 
word is then uniformly recurrent, its language does not depend on the position of the starting point x, 
but only on the transformation T, its complexity satisfies C(n) = (k — l)n + 1 for all n E N and no BS 
factor is weak. 

The language of the fc-iet word u is closed under reversal if and only if the permutation ir is symmetric. 
In such a case, the language £(u) contains infinitely many palindromes and, as shown in [7], the equality 
in (3.1) is attained. Hence, according to Theorem 8, the fc-iet words are rich. It is easy to describe the 
infinite palindromic branches for such fc-iet words. The one with the empty word as its center is obtained 
as the coding of the orbit {T n (x)\n G Z} with the starting point x = 1/2 and the branch with the center 
dj E A as the coding of the orbit with the starting point x = X^j<j a i + a j/^- 

The fc-iet words provide a generalization of Sturmian words due to the well-known connection between 
Sturmian and mechanical words [37]. 

Theorem 14. Let u be an infinite word. Then u is Sturmian if and only if u is a 2-iet word with an 
irrational partition of the unit interval. 
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Recently, in [45], a different generalization of Sturmian sequences is considered. It in fact corresponds 
to a special subclass of fc-iet words given by coding a trajectory in a regular 2n-gon. 

5.2. Combinatorial generalizations. Let us write down and baptize the generalizations of properties 
from Theorem 13. We will then refer to them and study their relations. Let u be an infinite word over 
the alphabet A. Denote k = if A. 

(1) Property C: 

the factor complexity of u satisfies C(n) = (k — l)n + 1 for all n G N. 

(2) Property CTZ: 

u contains one left special and one right special factor of every length. 

(3) Property BO: 

all bispecial factors of u are ordinary. 

(4) Property ft: 

any factor of u has exactly k return words. 

(5) Property V: 

the palindromic complexity of u satisfies V(n) + V(n + 1) = k + 1 for all n G N. 

(6) Property VS: 

every palindrome has a unique palindromic extension in u. 

(7) Balance properties: 

(a) Property By. 

u is aperiodic and for all a G A and for all factors w,v G C(u) with \w\ — \v\ it holds 

\\w\a ~ \v\a\ <k-l. 

(b) Property £>a: 

u is aperiodic and there exists a G A such that for all factors w,v G C(u) with \w\ = \v\ it 
holds 

\\w\a ~ \v\a\ <k-l. 

(c) Property AC: 

u is aperiodic and the abelian complexity of u satisfies AC(n) = k for all n G N, n > 1. 

At first, let us mention which properties are satisfied by the two generalizations of Sturmian words 
from Section 5.1. AR words fulfill Properties: C,CTZ,BO,TZ,V,VS and k-iet words satisfy Properties: 
C, BO, TZ. If moreover the permutation defining the fc-iet word is symmetric, then these words have 
Properties V and VS. Property CTZ does not hold for fc-iet words. 

It follows directly from the definition that some Properties imply others. For instance, by (2.1) BO 
implies C. They are not equivalent as shown by the following example taken from [26]. 

Example 3 (C ^> BO). The infinite ternary word limbec (fi n {a), where ip(a) = ab, (p(b) — cab, ip{c) = 
ccab - a recoding of the Chacon substitution - has the complexity 2n + 1 for every n G N, but contains 
infinitely many strong and weak BS factors. 

In the sequel, we will show that no two of these properties are equivalent on a multiliteral alphabet. 
Concerning Properties By,B^ and AC, we will not treat them but in the last section since they are 
very restrictive, and consequently, satisfied only by a small class of infinite words. 

5.3. Property £1Z. Property CTZ does not characterize AR words since it is satisfied by a larger class of 
words. Infinite words with the language closed under reversal and satisfying Property CTZ coincide with 
extensively studied aperiodic episturmian words. Nevertheless, Property CTZ may be satisfied by words 
whose language is not closed under reversal, as illustrated in [23] by the following example. It shows also 
that Property CTZ does not guarantee Properties C, BO, TZ, V, VS. 

Example 4 (CTZ ^> closeness under reversal, C, BO, TZ, V , VS). If we construct an infinite word u so that 
we replace 6 with be in the Fibonacci word abaababaabaabab . . ., the fixed point of ip : a — > ab, b — > a, 
then be is a factor of C(u), however cb not. It is easy to see that such a word has still a unique infinite RS 
and a unique LS branch (the infinite word u itself). Consequently, Property CTZ is preserved. However, 
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both of these infinite special branches have only two extensions, hence Property C (and BO as well) fails. 
The factor c has only two return words caab and cab, hence Property TZ does not hold. Moreover, as u is 
uniformly recurrent and its language is not closed under reversal, it contains by Theorem 2 only a finite 
number of palindromes. Therefore, Properties V and V£ are not satisfied. 

On the other hand, observing fc-iet words, we learn that none of Properties C, BO, TZ, V , V£ imply CTZ. 
The problem to describe the class of infinite words with Property CTZ whose language is not closed under 
reversal requires a further study. 

5.4. Property TZ. Let us recall that infinite words with Property TZ are necessarily uniformly recurrent. 
If their language is not closed under reversal, then it cannot contain infinitely many palindromes by 
Theorem 2. Such words exist, as illustrated by the following example, therefore, Property TZ does not 
imply V . 

Example 5 (TZ =f> V). The fixed point u of <p, where ip(a) = aab, <p(b) = ac, <p(c) = a, contains bac, 
but cab is not its factor. The fact that every factor of u has three return words is explained in [9] for 
a whole class of infinite words coding /3-integers. 

We have seen that AR words and fc-iet words have both Property TZ and C, however, as shown in [26] 
by the following example, Property C does not imply Property TZ on multiliteral alphabets. 

Example 6 (C 7$- TZ). The fixed point of (p : a — > ab, b — >• cab, c — > ccab - the above mentioned recoding 
of the Chacon substitution - has the complexity 2n + 1 for every n £ N, but contains more than three 
return words of certain factors (for example the factor be has 4 return words: bca, beca, bcaba and becaba. 

The following theorems come from the paper [9] that is devoted to the study of Property TZ for infinite 
words on multiliteral alphabets. Let us observe once more AR words and fc-iet words, these classes satisfy 
not only Property C, but also Property BO. It is thus natural to ask whether Property BO guarantees 
TZ. The corollary of the following theorem will provide an answer. 

Theorem 15. If u is an infinite word with no weak BS factors, then u has Property TZ if and only if u 
is uniformly recurrent and satisfies C. 

Let us underline, an infinite word u has Property BO if and only if it has Property C and contains no 
weak BS factors. It results in the advertised corollary. 

Corollary 16. Let u be a uniformly recurrent infinite word. Then 

BO^TZ. 

If we restrict our consideration to the ternary alphabet, the implication can be reversed. 
Theorem 17. Let u be a ternary uniformly recurrent infinite word. Then 

BO^TZ. 

As soon as the alphabet has more than three letters, Property TZ does not imply Property BO any 
more. 

Example 7 (TZ ^> BO). The uniformly recurrent infinite word u = lim^oo f n (a), where 

ip(a) = acbca, <p(b) = aebcadbdaca, ip(c) — dbcbdacadbd, ip(d) = dbcbd, 

satisfies TZ, but not C (since C(n) is even for all neN) and u contains, of course, weak BS factors. For 
details consult [9]. 

The question whether there exists a nice characterization of words with Property TZ on alphabets with 
more than three letters remains open. 
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5.5. Property V and V£. The paper [8] is focused on the study of Properties V and V£. As soon as 
an infinite word u has Property V£ , then u has exactly one infinite palindromic branch with center a 
for every letter a 6 A and one infinite palindromic branch with center e. Therefore, u contains exactly 
#A palindromes for every odd length (central factors of palindromic branches with centers a £ A) and 
one palindrome for every even length (central factor of the infinite palindromic branch with center e). 
Consequently, Property V is also satisfied by u. 

Let us recall that Property V may be reformulated in the following way 

(5.1) V{n + 2) = V{n) for all n E N, 

where P(0) = 1. We will equally use both of the forms of Property P. 

Let u be an infinite word satisfying VS. The language £(u) contains infinitely many palindromes, but 
it need not be closed under reversal, neither recurrent nor rich as illustrated by the following example. 

Example 8 (V£ 7^ closeness under reversal, V£ 7^ richness). The infinite word u on the alphabet 
{a, b, c} defined in the following way: 

u = c a^ccb ccc y a cccc b ccccc a cccccc b ccccccc a . . . 

2x 3x 4x 5x 6x 7x 

has three infinite palindromic branches with centers a, b and c 

. . . cccaccc . . . , ... cccbccc . . . , ... ccccccc . . . 

and one infinite palindromic branch with central factors of even length of the form . . . cccccccc . . . Indeed, 
u has the factor accb, however its mirror image bcca does not belong to the language C(u). Moreover, u 
is not rich since the prefix caccbccca of length 9 contains only 9 palindromes: 

e, a, 6, c, cc, cac, cbc 7 ccc and ccbcc. 

However, if the language £(u) is closed under reversal, then it is possible to say more about the relation 
of Properties V and C and the richness of u. When both V and C are satisfied, the equality in (3.1) is 
reached. Application of Theorem 8 provides us with the following corollary. 

Corollary 18. Let u be an infinite word whose language is closed under reversal. Then 

V + C richness of u. 

The first example shows that Property V itself does not guarantee richness even if the language is 
closed under reversal. The second one illustrates that the implication in Corollary 18 cannot be reversed. 

Example 9 {V£ 7^ richness, V£ ^> C). A known example of an infinite word with the language closed 
under reversal and with a higher factor complexity is the billiard sequence on three letters, for which 
C(n) = n 2 + n + 1. As shown in [14], such words satisfy Property V£, hence V as well. Consequently, 
billiard sequences do not reach the upper bound in (3.1) and by Theorem 8 cannot be rich. 

Example 10 (richness ^> V, richness 7^ C). Let ip be defined on an radetter alphabet as follows: 

<£(0) = 0*1, ^(1)=0*2, ...,ip(m-2) =0*(m- 1), ip(m - 1) = s , 

where s,teN and t > s > 2. The fixed point u of ip satisfies the equality V(n + 1) + V{n) = AC(n) + 2 
for all n. As the language is closed under reversal, by Theorem 8 u is rich. Property V is not satisfied 
since the sum V(n-\- 1) + V(n) is not constant. Further properties of palindromes in u can be found in [4]. 

Let us examine in the sequel the connection between Properties C and V, resp. C and V£. 

5.5.1. Ternary alphabet. Let us limit our considerations to the ternary alphabet. The following theorem 
and examples come from [8]. 

Theorem 19. Let u be an infinite ternary word with the language closed under reversal. Then 

(1) C^P, 

(2) BO VS. 
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The implication in Theorem 19 cannot be reversed. We have already illustrated in Example 9 that 
even the stronger property V£ does not ensure C. Let us provide one more counterexample - a fixed point 
of a substitution. 

Example 11 (V£ ^> C). Denote by u the infinite ternary word being the fixed point of the substitution 
$ defined by 

(5.2) <f>(a) = aba, $(&) = cac, $(c) = aca. 

Then the language of u is closed under reversal. On one hand, u has Property V£, consequently, u has 
Property V , too. On the other hand, Property C fails and £(u) contains infinitely many weak BS factors. 

Properties V and V£ are equivalent for binary words. However already for ternary words, the impli- 
cation V =>■ V£ does not hold any more. 

Example 12 (V ^> V£). Let v be the ternary infinite word defined by v = *(u), where * : {^4, B}* ->• 
{a, b, c}* is the morphism given by 

^(A) = be and = baa, 

and u is the fixed point of the substitution (p defined by 

ip(A) = ABB ABB A, tp(B) = ABA. 
Then v satisfies V, but does not satisfy V£. 

The relation between 1Z and V follows from Theorem 19 and Theorem 17. 
Corollary 20. Let u be an infinite ternary word with the language closed under reversal. Then 

Tl^>V£. 

The implication cannot be reversed. 

Example 13 (V£ ^> TV). Consider the fixed point u of the substitution in (5.2). As mentioned above, 
u contains weak BS factors. Then by Theorem 17, u does not satisfy 1Z. 

Putting together Theorems 19 and Corollary 18, we obtain one more corollary. 

Corollary 21. Let u be an infinite ternary word with the language closed under reversal. Then 

C => richness of u. 

In contrast with Corollary 18, we see that on a ternary alphabet already Property C itself ensures 
richness. 

Neither in this case, the reversed implication holds. Consult Example 10 or the following example 
with a periodic word. 

Example 14 (richness 7^ C). The periodic infinite word (abeba) 1 ^ is rich (since return words of palin- 
dromic factors are palindromes) and has a bounded complexity. 

5.5.2. Multiliteral alphabet. In this section, two new theorems concerning Properties V and V£ for mul- 
tiliteral infinite words will be proved. 

Theorem 22. Let u be an infinite word with the language closed under reversal. 

Assume C: V£ BO. 

Proof. (4=): Let us prove the statement by contradiction. Assume that Property BO holds and Property 
V£ does not. It is clear that the property V£ can only be violated on a palindromic BS factor. By 
Property BO, all palindromic factors have their bilateral order equal to zero. By Proposition 10, they 
have an odd number of palindromic extensions, particularly at least one. 

Since the language is closed under reversal, Theorem 3 implies the inequality (3.1) for all n E N 

V(n)+V(n+1) < 2 + AC(ra). 
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Let w denote the shortest palindromic BS factor that does not have exactly one palindromic extension. 
Denote N = \w\. Then we have for all n < N, 

V(n)+V(n + l) = #.4+1. 

Since Property BO implies Property C, we have 2 + AC(n) = 2 + (#A — 1), hence the equality in (3.1) 
is attained for all n < N. 

Since w has to have at least 3 palindromic extensions, one can see that V(N + 2) > V(N) + 2. Thus, we 
obtain V(N + 1) +T(N + 2) > V{N + 1) +T{N) + 2 = #.4 + 3 = AC(7V + l)+4, which is a contradiction 
with (3.1). We conclude that Property V£ holds. 

(=>): Assume Property V£ holds. Then Property V holds as well. By Corollary 18 u is rich. Consequently, 
we can apply Theorem 11 and we obtain b(w) = for all non-palindromic BS factors and b(w) = 
#Pext(u>) — 1 for all palindromic BS factors. By Property V£ every palindromic BS factor has a unique 
palindromic extension, thus b(w) — for palindromic BS factors, too. □ 

Let us deduce several Corollaries of Theorem 22. The most straightforward concerns richness and 
Property BO. It follows combining Theorems 22 and 8. 

Corollary 23. Let u be an infinite word with the language closed under reversal. Then 

BO => richness of u. 
Putting together Theorems 2, 15 and 22, we obtain the following corollaries. 
Corollary 24. Let u be a uniformly recurrent infinite word. 

Assume C: V£ => TZ. 

The reversed implication does not hold. Property TZ does not even guarantee the weaker property V . 

Example 15 (TZ + C ^> V). Consider again the infinite word from the previous section: the fixed point 
u of ip, where ip(a) — aab, ip(b) = ac, f(c) = a. Properties C and 1Z are satisfied (as explained in [9]), 
u is uniformly recurrent and the language £(u) is not closed under reversal. By Theorem 2, u contains 
only a finite number of palindromes. 

Notice that the assumptions in Corollary 24 imply that the language £(u) is closed under reversal. 
It is natural to ask whether the implication TZ => V£ holds for infinite words with the language closed 
under reversal. The answer is however negative. Property TZ does not imply even the weaker property V . 

Example 16. (7^ + reversal closeness ^ V) Consider again the uniformly recurrent infinite word from [9] 
given by u = lim n ^oo ip n (a), where 

tf(a) — acbca, <f(b) — acbcadbdaca, f(c) = dbcbdacadbd, <f(d) — dbcbd, 

It satisfies TZ, but C and BO are violated. It is not difficult to find infinitely many palindromes among 
weak BS factors. Thus, the language £(u) is closed under reversal. However V£ is not satisfied because 
cbc, dbd e C(u). Nor V holds since V{1) + V{2) =4^5. 

We see in the previous examples that to demand either only the closeness under reversal or only 
Property C in order to reverse the implication in Corollary 24 is not sufficient. It is however not solved 
whether any infinite word with the language closed under reversal and having Properties C and TZ satisfies 
Property V£ or at least V as well. 

Corollary 25. Let u be a uniformly recurrent infinite word. 

Assume V£: richness of u4^TZ. 
Proof. Recall that by Theorem 2, the language is closed under reversal. 

(=>): Suppose u is rich. Then Property V£ guarantees that Property V holds as well. Property V and 
the closeness of £(u) under reversal together with Theorem 8 implies C is also satisfied. The statement 
follows then by Corollary 24. 
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(<=): Let us prove the second implication by contradiction. Assume 1Z is satisfied and u is not rich. Theo- 
rem 6 claims that there exists a palindrome w which has a complete return word that is not a palindrome 
itself. As V£ holds, the language has #A + 1 biinfinite palindromic branches. As w is a palindrome, 
we can find it in the middle of one branch. Since u is uniformly recurrent, we can find roina bounded 
distance from the center (on both sides) of the remaining #A branches. Thus we have #A distinct 
palindromic complete return words of w. As w was supposed to have a non-palindromic return word, we 
have a contradiction with 1Z. □ 

In Theorem 22 for infinite words having Property C, we have proved that Property V£ coincides with 
Property BO. Under the same assumption on the complexity, we are again able to characterize Property 
V imposing this time a weaker condition on bilateral orders of BS factors. 

Theorem 26. Let u be an infinite word with the language closed under reversal and satisfying Property C. 
Then Property V holds if and only if any bispecial factor w of u satisfies: 

• if w is non-palindromic, then 

b(w) = 0, 

• if w is a palindrome, then 

b(w) = #Pext(w) - 1. 

Proof. (<=): Theorem 11 implies that u is rich. Since the language is closed under reversal, we can use 
Theorem 8. By Property C, we have V(n + 1) + V(n) = AC(n) + 2 = #A + 1, thus Property V holds. 

(=>): Corollary 18 states that u is rich. The statement about bilateral orders follows then by Theorem 11. 

□ 

This theorem may be immediately reformulated using Theorem 11. 

Corollary 27. Let u be an infinite word with the language closed under reversal. 

Assume C: V richness of u. 

Non-palindromic bispecial factors can really occur in infinite words with the language closed under 
reversal and satisfying Properties C and V£, thus V as well. This means that there exist rich words with 
non-palindromic BS factors. 

Example 17. A ternary word with such properties is v = 7r(u), where u = <p 2 {w) and 
tp : A -> CAC, B -> CACBD, C -> BDBCA, D BDB, 

7r : A — > ba, B ->■ b, C -> a, D -> abc. 

The substitution tp satisfies for any letter x £ {A, B, C, D}, if we cut off the last two letters of (p 2n (x), 
we get a palindrome. Together with the uniform recurrence of u, Theorem 2 implies that the language 
£(u) is closed under reversal. Every LS factor of u is a prefix of <p 2n (B) or ip 2n (C) for some n E N, 
consequently, AC(n) = 2 for all n £ N, n > 1. 

For every non-empty palindrome w G £(u), its morphic image tt(w) without first two letters is a palin- 
drome. As v contains infinitely many distinct palindromes and is a morphic image of a uniformly recurrent 
word, thus uniformly recurrent, too, the language £(v) is closed under reversal. The word v has two 
infinite LS branches: every LS factor of v is either a prefix of n((p 2n (B)) or of n(ip 2n (C)). Therefore, v 
satisfies Property C. Moreover, v contains only ordinary BS factors. Applying Theorem 22, Property V£ 
holds as well. Remark that the factor ba is a non-palindromic BS factor of v. 
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5.6. Balance properties. It is a direct consequence of the definition that 
(5.3) AC By B 3 . 

The first implication follows from the fact that if there are two factors v, w of the same length that contain 
a distinct number of letters a, say I and r, then there exist factors containing any number of letters a 
between I and r (they may be found in any factor having v as its prefix and w as its suffix). 

Let us point out that our favorite generalizations of Sturmian words, namely AR words and fc-iet words, 
violate the property By. The paper [19] provides a construction of an AR word u that is not c-balanced 
for any c. The same property have also all 3-iet words given by the transformation T associated with the 
symmetric permutation and verifying the property i.d.o.c, which can be shown using methods from [1]. 

It is natural to ask whether infinite words on multiliteral alphabets with Property AC exist. A recent 
answer has been provided in [21]: there are no infinite words satisfying AC on alphabets containing more 
than 3 letters. On the other hand, there exist ternary infinite words with Property AC as shown by the 
example taken from [42]. 

Example 18. Let v be any aperiodic infinite word on {A, B} and put u = 7r(v), where 7r is the morphism 
defined by n(A) = abc, n(B) = acb. Then AC(n) = 3 for all ngN, n > 1. 

A more general theorem has been proved ibidem. 

Theorem 28. If an aperiodic uniformly recurrent infinite word u on a ternary alphabet is 1-balanced, 
then u has Property AC . 

Let us underline in the following examples that none of the implications in (5.3) can be reversed. The 
first example comes from [43] and the second one is taken from [47]. 

Example 19 {By AC). The ternary Tribonacci word - the fixed point of the substitution p : a — > 
ab, b — > ac, c—t a - is 2-balanced, however its abelian complexity reaches five values: 3, 4, 5, 6, 7. Notice 
that the Tribonacci word belongs to AR words, which satisfy Properties C, CTZ, BO, 1Z, P, VE. 

Example 20 (Ba ^> By). The fixed point u of the substitution p : a — > aab, b — V c, c — > ab has the 
following properties (shown in [47]): 

• for any factors v,w G C(u) with \v\ — \w\, it holds 

\\v\ x - \w\ x \ < 2 if x G {b,c}, 

• there exist v, w e £(u) with \v\ — \w\ such that 

\\v\a - \w\ a \ = 3. 

Thus, u has Property £>a. The word u is a coding of distances between neighboring /3-integers, where (3 
is the largest root of the polynomial x 3 — 2x 2 — x + 1. It is moreover known (see [28]) to verify Property 
BO. but not CTZ. Theorem 15 implies that u has Property 1Z as well. Its language is not closed under 
reversal, consequently, neither V£ nor V holds. 

Generally, it is difficult to decide whether an infinite word has Property Bg or By- A slightly simpler 
problem is to study infinite words that are c-balanced for some c. The criterion for existence of such 
a constant c for fixed points of a primitive substitution has been provided in [2] , observing the spectra of 
adjacent matrices of substitutions. In general, it is however impossible to determine the minimal value 
of c from the spectrum. To our knowledge, besides the ternary words considered in Examples 19 and 20, 
the only non-sturmian fixed points of primitive substitutions, for which the minimal value of c is known, 
have been examined in [10] and [48]. 

6. Overview of relations and examples 

In this section we provide a brief overview of relations and examples presented in the paper. Most of 
the relations are depicted in Figure 3. Examples are listed in Table 1. The word is either a fixed point 
of the given substitution, the image by the morphism 7r of a fixed point of the substitution ip, the limit 
of the sequence (u n ) or otherwise specified. 
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implication 

> irreversible implication 



4 >■ equivalence 

/ )• invalid implication 

number #.4, 

UR uniform recurrence 

CuR closeness under reversal 

qq b(w) = /#P ex t(w^) — 1 if w is bispecial palindromic 

p 1 otherwise 



Figure 3. Diagram of known relations (assumptions are marked as labels of arrows) 
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word 


properties 


reference 


u = ab, u n+ i = u n abu^ 


uniformly recurrent, closed under reversal, 
finite number of palindromes 


ex. 1 on p. 5, 
[13] 


u Q = e, u n+1 = u n abc n+L u n 


recurrent, oo-many palindromes, not 
closed under reversal 


ex. 2 on p. 5 


a — > ab, b — > cab, c — > ccab 


C, not BO, not 1Z 


ex. 3 on p. 11, 
ex. 6 on p. 12, 
[26] 


tp: A -> AB, B ^ A; n: A ->• a, 
B — > &c 


CTZ, not closed under reversal, finite num- 
ber of palindromes, not C, not TZ 


ex. 4 on p. 11 


a flab, & — > ac, c — > a 


TZ, not closed under reversal 


ex. 5 on p. 12, 
[9] 


a — > acbca, b — > acbcadbdaca, c — > 
dbcbdacadbd, d — > d&c&d 


TZ, closed under reversal, not C, not V 


ex. 7 on p. 12, 
ex. 16 on p. 15, 
[9] 


u = ca^cc^b^ccc^accccbccccca . . . 

2x 3x 4x 5x 


oo-many palindromes, not closed under re- 
versal, not rich 


ex. 8 on p. 13 


billiard sequence on three letters 


closed under reversal, V£ , not C, not rich 


ex. 9 on p. 13, 
[14] 


a —7- aab, b — > aac, c — > aa 


rich, not C, not V 


ex. 10 on p. 13, 
[4] 


a — > a6a, & — > cac, c — > aca 


closed under reversal, V£, not C, not TZ 


ex. 11 on p. 14, 
ex. 13 on p. 14, 
[8] 


(p: A ^ ABB ABB A, B -> ASA; tt: 
A -> 6c, £? -> &aa 


closed under reversal, C,V, not V£ 


ex. 12 on p. 14, 
[8] 


{abcbaY 


rich, not C 


ex. 14 on p. 14 


a — > aab, b — > ac, c — > a 


C, TZ, not closed under reversal 


ex. 15 on p. 15, 
[9] 


A -> CAC, B -> CACBD, C -> 
BDBCA, D -> BDB; tt: A -> 6a, 
— > 6, C — > a, Z) -> a&c 


VE, C, closed under reversal, rich, contains 
non-palindromic BS factors 


ex. 17 on p. 16 


u = 7r(v), 7r: A — > afec, £> — > ac&, v 
is an aperiodic word over {A, £?} 


AC 


ex. 18 on p. 17, 
[42] 


a — >• a6, 6 — >• ac, c — >• a 


CTZ,BO,TZ,V£, B v , not AC 


ex. 19 on p. 17, 
[43] 


a — > aab, b — »• c, c — »• ab 


£>g, not B\/, not closed under reversal, BO, 
not CTZ, 1Z 


ex. 20 on p. 17, 
[47] 



Table 1. Example overview 
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